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ABSTRACT 

The INT/WFC Photometric Ha Survey of the Northern Galactic Plane (IPHAS) is 
an imaging survey being carried out in Ha, r' and i' filters, with the Wide Field 
Camera (WFC) on the 2.5-metre Isaac Newton Telescope (INT) to a depth of r' —20 
(lOcr). The survey is aimed at revealing the large scale organisation of the Milky Way 
and can be applied to identifying a range of stellar populations within it. Mapping 
emission line objects enables a particular focus on objects in the young and old stages 
of stellar evolution ranging from early T-Tauri stars to late planetary nebulae. In this 
paper we present the IPHAS Initial Data Release, primarily a photometric catalogue of 
about 200 million unique objects, coupled with associated image data covering about 
1,600 square degrees in three passbands. We note how access to the primary data 
products has been implemented through use of standard virtual observatory publishing 
interfaces. Simple traditional web access is provided to the main IPHAS photometric 
catalogue, in addition to a number of common catalogues (such as 2MASS) which 
© 2008 RAS are of immediate relevance. Access through the AstroGrid VO Desktop opens up the 

full range of analysis options, and allows full integration with the wider range of 
data and services available through the Virtual Observatory. The IDR represents the 
la.rfrest dataset niihli.'shed nrima.rilv thrniiffh VO interfaces tn date, a.nd .sn stand.'! a.s 
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1 INTRODUCTION 

The INT Photometric Ha Survey of the Northern Galactic 
Plane (IPHAS) is a 1800 deg^ CCD survey of the northern 
Milky Way < 5°) using the r', i' and Ha passbands (ta- 
ble[T| down to r' = 20 (Vega, lOcr for a point-like source in an 
aperture of 1.2 arcsec). The execution of IPHAS, the prop- 
erties of its characteristic colour-colour plane, a nd the likely 
scient ific scope of the survey were discussed bv lDrew et alj 
l|2005h 

This paper describes the Initial Data Release (IDR) of 
the IPHAS survey, containing observations for about 200 
million objects and comprising 2.4 Tb of processed imaging 
data. The survey is being carried out using the Wide Field 
Camera (WFC) on the INT telescope. The WFC consists of 
four 2048x4096 pixel CCDs with a pixel scale of 0.33 arcsec 
and a field of view of 0.29 square degrees. The camera CCD 
detectors are configured in an L-shape and CCD number 3 
is slightly vignetted in one corner. 

The data have already been used to carry out a series of 
investigations: the interaction between the planetary nebul a 
Sh2-188 and the inter-stellar medium (|Wareing et al.l 20061); 
the d iscovery of a new quadrupolar nebula ( Mampaso et al.l 
I2OO6I ): the study of the properti es of a sample of cataclysmic 
variables (jWitham et al.l |2006| ) ; the interacti on of the jet 
of Cy gnus X-1 with the surrounding medium (| Russell et al.l 
I2OO7I ): brown dwarf searches ( Martin et al. 2008, submit ted); 
detection of nova progenitors (|Steeghs et al.|[2007l . l2006l ): the 
study of the properties o f symbiotic stars and first discov- 
eries (|Corradi et al.ll2007l ), the study of young pre-main se- 
quence stars in CygOB2 (Vink et al. 2008, submitted), the 
presentation of a sam ple of about 5000 Ha emitting stars 
IWitham et al.l [20081 ) and the description of a method for 
selecting A star s and exploiting t hem to estimate distances 
and reddening (|Drew et al.ll2003 ). Several other investiga- 
tions are on-going and we hope the data release described 
here will lead to many data mining programs by the com- 
munity. 

In addition to the optical data provided by IPH AS, 
the Galactic Plane Survey ( GPS; iLucas et all |2007| ) of 
the UKIDSS (|d^ et al.l I2OO6I ) is also observing a similar 
area in J, H and K. This will be followed in the next 
few years by a survey of the southern Galactic Plane, 
VPHAS-I-, o n the VLT Surve y Telescope (VST) using 
OmegaCam (|Kuiiken et al.|[2004l ). It ser ves as a digital suc - 
cessor to the UK Schmidt Ha Survey (jParker et al.|[2005l ) 
and will reach about 2 magnitudes deeper. The UVEX 
project is targeting the sa me area in the U and g bands. 
The Herschel observatory (|Pilbrattl 1200 5l ) will also target 
the Galactic Plane. Together, these surveys will provide a 
better understanding of the stellar content and properties 
of our Galaxy. 

The aim of this paper is to provide a guide to the Ini- 
tial Data Release of the IPHAS survey, describe the data 
products and the data properties and usage. Section [2] sum- 
marises the observations, sky coverage and data quality. The 
data processing is briefly described in section [3] The data 
products contained in this release are described in section [l] 
Section [5] shows an example usage of the catalogue to ex- 
tract sources detected in a large area of sky and cross match 
with the GPS survey. Finally in section [6] we describe the 
different user access interfaces to the data. 



Filter 


Cen Wave (A) 


Width (A) 


AB (mag) 


r' 


6254.1 


1350.0 


0.164 


Ha 


6568.3 


96.0 


0.352 


i' 


7771.9 


1490.0 


0.413 



Table 1. Fihcr parameters. The second column lists the effec- 
tive wavelength of the filter and the third its width. The last 
column lists the value to bo added to the Vega magnitude to con- 
vert it to AB. Filter profiles arc available from the ING web site 
(www.ing.iac.es) and discussed in Drew et al. 2005. 



2 SKY COVERAGE AND DATA QUALITY 

The IDR contains 1,600 deg^ of imaging data in three bands 
covering the northern galactic plane at latitudes |6| < 5° and 
longitudes 30° < ^ < 220° . Figure [1] shows the distribution 
in the sky. The data have been obtained between August 
2003 and December 2005 during a total of 212 nights (about 
26 per cent of this observing time has been completely lost 
due to bad weather). 

The area to be observed has been divided into "fields" 
each of them corresponding to the coverage of the WFC. 
Fields have been defined in a diagonal rather than rectan- 
gular geometry in order to take advantage of the L-shape of 
the detector configuration allowing for a 5 per cent overlap 
between adjacent pointings. Figure [2] provides an example 
of the observation configuration. 

Gaps between detectors are typically 20" . Therefore the 
area not observed in each pointing due to the gaps is about 
12 square arcmin (i.e. ~1 per cent). In order to cover the 
chip gaps and bad columns each "field" is observed in pairs 
which are referred to as "on" and "off" positions with an 
offset of 5'W and 5'S between the pointings. Most of the 
objects are thus observed twice. Field pairs are observed 
consecutively in the three bands before moving to the next 
field pair in order to minimize the effects of photometric 
variability. Fields with mean stellar seeing values larger than 
2 arcsec, ellipticity larger than 0.2, r'-band sky counts larger 
than 2000 ADUs (which indicates observations too close to 
the moon or with clouds such that sensitivity is reduced to a 
significantly lower magnitude limit) or anomalous star ratios 
between the filters (which indicates variable clouds during 
observations) are scheduled for re-observation in the three 
filters. Each field pair is assigned an identification which we 
will refer to as "Field number". The average stellar counts 
is ~20,000 per square degree although this value is highly 
dependent on the position along the Galactic plane as seen 
in figure |3] which shows the number of sources detected in 
each field as a function of Galactic longitude. 

The overall survey properties and quality of the data is 
summarised in table [2] The exposure time for each filter is 
30 s in r' (except in the first year of observations, 2003, when 
it was 10 s), 10 s in i' and 120 s in Ha. All data obtained up 
to December 2005 have been included in the IDR regardless 
of seeing or photometric conditions. Figure |4] shows the see- 
ing values for these data in the r' band. Although the mean 
value is about 1.26 arcsec (and the median 1.18 arcsec) there 
are clearly observations obtained in poorer conditions. The 
maximum acceptable seeing value for the data to be ac- 
cepted as final survey quality is 2 arcsec in the three bands. 
Data with larger seeing values have been included in this 
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Figure 1. IPHAS Initial Data Release sky coverage in galactic coordinates (degrees). The top figure displays the coverage of the full 
release and the bottom one the coverage of the PhotoObjBest best observations subset. Brightness indicates the total line-of-sight 
reddening as computed from the Schlegel et al. (1998) dust maps using 4 pixel interpolation. 



44.0 





310.5 



Figure 2. Tiling configuration of 3 X 3 pointings covering an area 
of ~2 square degrees. Chips are outlined in black. Field numbers 
are also shown. An additional pattern with an offset of 5' X 5' is 
not included in order to make the figure clearer. 



Figure 3. Source counts as a function of Galactic longitude (one 
data point per IPHAS field). 



release but they do not appear by default to the user. For 
this purpose we are releasing a full catalogue, PhotoObj, 
with all the observations obtained and a PhotoObjBest 
subset which only contains the observations which pass our 
scientific criteria for this release (see section 14.21 for more 
details). As noted in table [5] the coverage of this latter cat- 
alogue is about 1,000 square degrees and contains about 90 
million unique objects. The Final Data Release of the data 
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Figure 4. Seeing distribution in the r' band for the whole IDR. 
One of the criteria for inclusion in the PhotoObjBest catalogue 
is that objects have to be observed with seeing conditions better 
than 2.0 in the three bands (left of the dashed line). 
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Figure 5. Magnitude limit distribution for the IDR in the three 
survey bands. The r' band distribution has been split in two, the 
leftmost corresponding to the 2003 observations and the right- 
most corresponding to post-2003 observations in which the expo- 
sure time was increased from 10 s to 30 s. 

will extend the defined scientific criteria to the whole 1,800 
square degrees. Figure [5] shows tfie magnitude limit (5(t) for 
all the objects in the three survey bands. Note that r' ob- 
servations obtained in 2003 are shallower due to the smaller 
integration time used. 



3 DATA PROCESSING 

All the observations obtained with the INT telescope at La 
Palma are transmitted in pseudo-real time to the Cambridge 
Astronomy Survey Unit (CASU) for data archival via the 
internet. Although quasi real time data processing is possi- 
ble it is preferable to perform the data processing on a run 
by run basis. The calibration is then performed using the 
whole run master flats and biases. For each observation the 
data are stored in a Multi-Extension Fits file (MEF) with a 



primary header describing the overall characteristics of the 
observation (pointing, filter, exposure time, ...) and four im- 
age extensions corresponding to each of the CCD detectors. 

The data are pr ocessed by CASU as described in 
llrwin and Le^ l|200lf ) following the sa me steps as for the 
processing of the W ide Field Survey (jlrwin et al.l I2OO5I : 
iMcMahon et al.ll200ll ) which used the same telescope and 
camera combination. We provide here a short description 
of the reduction s teps; additional details are available from 
iDrew et al.1 (|2005h . 

The data are first debiassed (full 2D bias removal is 
necessary). Bad pixels and columns are then flagged and 
recorded in confidence maps, which are used during cata- 
logue generation. Linearity tests using sequences of dome 
fiats revealed that the CCDs have significant non-linearities 
at the level of 1 to 2 per cent depending on the detector and 
period of observation. Due to several changes to the WFC 
controllers the linearity values change from time to time so 
linearity measurements are updated periodically. A linearity 
correction using look-up-tables is then applied to all data. 
Flatfield images in each band are constructed by combining 
a series of sky fiats obtained in bright sky conditions during 

the twilight for each run. 

The standard catalogue generation software (|lrwinl 

Il985h makes direct use of the confidence maps previously 
generated (described in l4.1.ip for a variety of tasks. These in- 
clude object detection and parametrization, producing qual- 
ity control information, standard object descriptors and de- 
tected object overlay files. The possibly varying sky back- 
ground is estimated automatically, prior to object detection, 
using a combination of robust iteratively clipped estimators. 
The image catalogues are then further processed to yield 
morphological classification for detected objects and used 
to generate astrometric and photometric calibration infor- 
mation. 

Each detected object has an attached set of descriptors, 
forming the columns of the binary table and summarising 
derived position, shape and intensity information. During 
further processing stages ancillary information such as the 
sky properties, seeing and mean stellar image ellipticity, are 
derived from the catalogues and stored in the FITS headers 
attached to each catalogue extension. In addition to being 
the primary astronomical products from the pipeline pro- 
cessing, the catalogues and associated derived summary in- 
formation form the basis for astrometric and photometric 
calibration and quality control monitoring. 

Finally an astrometric solution starts with a rough 
World Coordinate System (WCS) based on the known tele- 
scope and camera geometry and is then progressively re- 
fined using the 2MASS catalogue. The WFC field distor- 
tion is modelled using a zenithal eq uidistant proje ction with 
polyn omial radial distortion (ZPN; ICreisen and C alabrc ttal 
12002! ) . The resulting internal astrometric precision is better 
than 100 mas over the whole WFC array (based on intercom- 
parison of overlap regions; see section[3]3} . The object detec- 
tion is performed in each band separately using a standard 
APM-style object detection and parametrization algorithm 
(|lrwinlll985h . The curve-of-growth, defined as the flux inside 
an aperture as a function of its radius, is calculated measur- 
ing the flux in a set of apertures of radius r/2, r, \/2r, 2r, 
2\/2r where r = 3.5 pixels (1.2 arcsec) and an automatic 
aperture correction (based on the average curve-of-growth 
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Parameter 



Value 



Telescope 
Instrument 
Field of view 
Detector scale 
Total area of survey 
Total area in IDR 

Area included in the PhotoObjBest catalogue 

Number of unique objects in the full catalogue 

Number of unique objects in the PhotoObjBest catalogue 

Filters 

Exposure time 
FWHM (median) 
Magnitude limit (Vega, 5cr) 
Astrometry 



2.5m Isaac Newton Telescope, La Palma 
Wide Field Camera 
0.29 sq. deg. (per pointing) 
0.33 arcsec/pixcl 
~1800 sq. deg 
~1,600 sq. deg 
~1000 sq. deg 
~200,000,000 
~90,000,000 
r', i'. Ha 
30s (r')t, 10s (i'), 
1.3" (r'), 1.2" (i'), 
21.8 (r'), 20.2 («'), 



120s (Ha) 
1.3" (Ha) 
20.6 (Ha) 



<0.1" (based on 2MASS) 



10 s in 2003 



Table 2. Survey parameters and IDR quality summary. 



for stellar images) is applied to all detected objects. Finally 
a distortion correction is applied to the photometry to take 
into account the change of scale with respect to the optical 
axis distance (see subsection 14. ip . 

It is straightforward to show that an aperture of radius 
~FWHM delivers optimal (i.e. 90 per cent of the tot al flux) 
phot ometry compared to more detailed PSF fitting (|Navloil 
1 19981 ) in non crowded areas. The default photometry is then 
done in an aperture of 1.2" and corrected for hght loses out- 
side the aperture. After adding some extra refinements to 
deal with the most crowded regions we find that there is 
little to be gained from PSF fitting (Irwin et al. 2008, in 
prep). The difference between crowded and non crowded ar- 
eas is shown in figure |6] which represents the nearest neigh- 
bour source distribution for two different fields: one with 
high source density to illustrate the worst case scenario of 
a crowded field and another with an average source density. 
For comparison we use the nearest neighbour probability 
density distribution (PD F) expected for a random sample 
which can be written as (|Bahcall and Soneiralll98ll ) 

Pjviv(6») = 2 7rpee-"''*' (1) 

where p is the average density and ivpO^ is the number 
of neighbours in the interval {0,tv9^). The expected mean 
nearest neighbour se paration can then be calculated as 
l|Scott and Toutlll989l ) 



*Pivjv(6')d6i= — 
2^ 



(2) 



For the space densities of the fields plotted in figure|6]we ob- 
tain {8} = 2.5" for the crowded field and { 61 ) = 4.7" for the 
average density field. The maximum of the Pnn distribution 
is given by 6,n — l/-^27rp and corresponds to 6m = 2.0" and 
9m = 3.8" in the crowded and average density fields re- 
spectively. The large discrepancy between the random and 
observed distributions for the high density field at 6' > 2" is 
due to missing close pairs which produces an overestimation 
of the nearest neighbours at S ~ 2". We can calculate the 
fraction of miss ed objects due to crowdi ng using the follow- 
ing expression (|lrwin and Trimbl3l 19841 ) 



Pcorr — 



log{l - 



where Pcorr is the corrected average density and Ofwhm the 
FWHM of the images(i.e. 1.2 arcsec). For the average den- 
sity field the fraction of sources lost due to close pairs is 9 
per cent while in the crowded field this percentage increases 
to 41 per cent. Note as well that non resolved objects pair, 
on average, with objects at a distance ~ {9} occupying, 
and overestimating, the central bins of the nearest neigh- 
bour distribution (figure [6] top). We also plot in figure |6] the 
corrected nearest neighbour PDF as a dotted line using the 
corrected average density given by equation [3] 

Note that the crowded field case represents a extreme 
case which has source density more than three times higher 
than the median source density of the survey. 

The values obtained for the mean nearest neighbour 
separations are clearly well above our seeing values in both 
cases indicating that aperture photometry is appropriate for 



our case. 



(3) 



3.1 Morphological Classification 

The morphological classification is based primarily on the 
aperture fluxes and the discrete curve-of-growth for each de- 
tected object defined by them. Ancilliary information such 
as object ellipticity derived from intensity-weighted second 
moments is also used but only as a secondary indicator. The 
curve-of-growth is a direct measure of the integral of the 
point spread function (PSF) out to various radii and is in- 
dependent of magnitude if the data are properly linearised, 
and if saturated images are excluded. In using this prop- 
erty the classifier further assumes that the effective PSF for 
stellar objects is constant over each detectorQ although in- 
dividual detectors are allowed to have different PSFs since 
all detectors, pointings and passbands are treated indepen- 
dently. 

The average stellar locus on each detector in these 
parameter spaces is generally well-defined and is used as 
the basis for a null hypothesis stellarness test. Figure [7] 



^ In practice we find that the effects of the spatial variation of 
the PSF on the aperture fluxes at the detector level are generally 
negligible. 
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Figure 6. Normalised nearest neighbour source distribution for 
a crowded field (top; 140 sources per arcmin'^, 146160 sources in 
total) and for an average source density field (bottom; 40 sources 
per arcmin^ ; 40760 sources in total) both observed with seeing of 
1.2 arcsec. The dashed line shows the distribution for a random 
sample (equation [TJ using the observed source density while the 
dotted line shows the expectation when the average source density 
is corrected for incompleteness. Close pairs are missing due to 
the seeing and this is reflected in the smaller number of pairs 
with respect to the Poisson expectation at seeing smaller than 
2 arcsec. This effect is larger in the crowded field leading to a 
large difference between the observed and expected normalized 
distributions. 



shows an example of this comparing the magnitude differ- 
ence recorded in two apertures (1.2 and 2.4 arcsec diameter) 
as a function of the derived overall magnitude and illustrates 
how this relates to the final classification for this frame. 

In practice, the aperture with radius r = 1.2 arcsec is 
used as a fixed reference and also defines the internal mag- 
nitude (flux) scale. The linearity of the system implies that 
the position of the stellar locus for any function of the aper- 
ture fluxes is independent of magnitude (at least until im- 
ages saturate). Therefore marginalising the flux ratios over 
magnitude yields one-dimensional distributions that can be 
used to greatly simplify locating the stellar locus using an 
iteratively clipped median. With the location estimated, the 
median of the absolute deviation from the median (MAD) 
provides a robust measure of the scatter about this locus 




o 



r magnitude 

Figure 7. An example of how the difference in flux recorded be- 
tween pairs of apertures is used to provide information for mor- 
phological classification. The stellar locus (black points) is self- 
defining at a constant offset (here 0.73 magnitudes) while the 
spread of points about this locus, after iterative clipping, defines 
the average scatter of stellar objects as a function of magnitude. 
Grey points denote noise-like objects that lie well below the stel- 
lar locus ridge, or non-stellar or blended stellar objects which lie 
above the stellar population. Saturated stars form the upturned 
sequence brighter than 13th magnitude. The offset of the top 
dashed line defines the average differential aperture correction for 
stellar objects for this pair of apertures and in combination with 
other pairs of apertures defines the overall stellar curvc-of-growth 
and hence individual stellar aperture corrections. 



as a function of magnitude, at least until galaxies dominate 
in number. This process is repeated iteratively for each dis- 
tribution, using 3-sigma clipping to remove non-stellar out- 
liers, until satisfactory convergence is reached. After con- 
vergence the equivalent Gaussian sigma is estimated using 
lygauss ~ 1A8AIAD and by this means each of the im- 
age shape descriptors (in this case flux ratios or ellipticity) 
can be renormalised to follow a zero-median, unit variance 
Gaussian-like N{0, 1) distribution. 

These measures are then directly combined to form the 
final classification statistic. The combination (essentially a 
unweighted sum of the normalised signed distributions) is 
designed to preserve information on the "sharpness" of the 
object profile and is finally renormalised, as a function of 
magnitude, to produce the equivalent of an overall A''(0, 1) 
classification statistic. 

In practice measures derived from real images do not 
exactly follow Gaussian distributions. However, by combin- 
ing multiple normalised distributions (with well-defined 1st 
and 2nd moments) , the Central Limit Theorem works in our 
favour such that the resulting overall statistic is Gaussian- 
like to a reasonable approximation and hence can be used 
with due care as the likelihood component of a Bayesian 
Classification scheme, making optional use of prior knowl- 
edge. 

Objects lying within 2-3(j of the stellar locus (i.e. of 
zero) are generally fiagged as stellar images, those below - 
3 to -5(T (i.e. sharper) as noise-like, and those above 2-3cr 
(i.e. more diffuse) as non-stellar. Although the discrete clas- 
sification scheme is based on the A^(0, 1) measure of stellar 
appearance it also has several overrides built in to attempt 
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to make it more reliable. For example, adjustments to the 
boundaries at the faint-end (to cope with increased rms 
noise in the statistic) and at the bright-end (to cope with 
saturation effects) are also made, while the overall image 
ellipticity provides a further check. 

A by-product of the curve-of-growth analysis and the 
classification is an estimate of the average PSF aperture 
correction for each detector for those apertures (up to and 
including 4r, which includes typically ~99 per cent, or more, 
of the total stellar flux) used in deriving the classification 
statistic. Accurate assessment of the aperture correction to 
place the (stellar) fluxes on a total fiux scale is a crucial com- 
ponent of the overall calibration. We find that this method 
of deriving aperture corrections contributes ±1 per cent 
to the overall photometry error budget and also provides a 
useful first order seeing correction for non-stellar sources. 
Further by-products of the morphological classification pro- 
cess are improved estimates of the seeing and average PSF 
ellipticity from making better use of well-defined stellar-only 
sources. These parameters are required for quality control 
monitoring of telescope performance and "atmospheric" see- 
ing. 

3.2 Photometric Calibration 

Photometri c calibration is done using series of Landolt stan- 
dard stars (|Landoltl Il992l l with photometry in the SDSS 
system with additional stars from IStetsonI (|2000l ). For each 
night a zero point and error estimate using the observations 
of all the standard fields in each filter is derived. For pho- 
tometric nights the calibration over the whole mosaic has 
an accuracy of 1-2 per cent. For the purpose of the pho- 
tometric calibration, standard star observations have been 
obtained each night at an interval of 2 h and have been used 
to calibrate the r' and i' frames. The Ha frames have been 
calibrated using a fixed zero-point offset of 3.14 magnitudes 
with respect to r', corresponding to the magnitude differ- 
ence in r'— Ha for Vega. This value h as been calcu lated 
by convolving the HST Vega spectrum (|Bohlinll2007h with 
the atmospheric throughput, INT optical throughput (pri- 
mary mirror and field corrector), CCD efficiency and the 
WFC filter curves (available from the INC web pages at 
www.ing.iac.es) so that r' — Ha = for Vega. 

All calibration is by default corrected during pipeline 
processing for the mean atmospheric extinction at La Palma 
(0.09 in r' and Ha and 0.05 in i' per unit airmass). 

During non-photometric nights, in otherwise acceptable 
observing conditions, we find that the derived zero point sys- 
tematic errors can be up to 10 per cent or more. Although 
the pipeline usually successfully flags such nights as non- 
photometric (see figure |5} it still leaves open the problem 
of what to do about tracking the varying extinction dur- 
ing these nights. For this Initial Data Release we have not 
included a global photometric solution for the whole sur- 
vey. While it is possible to perform an offset to a common 
reference system for a small subset of the observations us- 
ing overlaps, as demonstrated below in section [S] the ap- 
plication of such methods to the whole survey area is more 
complicated since one has to carefully select good reference 
frames and allow for variation in parameters like extinction 
or zero point over the whole survey period. Availability of 
good photometric observations to tie the solution as much as 
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Figure 8. Photometric zero point (ZP) versus date for two runs. 
The dashed lines are the expected values for a good photometric 
night in r' (red), i' (blue) and Ha (green). Dots represent the 
individual measurements for each frame. The top figure shows 
a good photometric run in which all the zero points are consis- 
tent with the expected values. The bottom one instead shows 
a variable run in which a large percentage of observations have 
been obtained in bad photometric conditions and therefore not 
included in the PhotoObjBest catalogue. These plots are used 
to flag problematic frames and runs. 



possible is crucial. Clearly this is a major task which we are 
currently investigating and is best accomplished when the 
survey is complete. Data from non-photometric nights have 
been included in the release but flagged as such so that they 
do not appear in our PhotoObjBest catalogue. Figure [9] 
shows the mean magnitude rms for the objects included in 
this catalogue, zero point uncertainties are at the level of 2 
per cent for r' and i' and 3 per cent for Ha. We also plot 
in flgure[TO]the stellar counts in the three bands along two 
different line of sights in the Galactic plane with different 
reddening. 

All magnitudes quoted are in the Vega system but we 
give the conversion factors to the AB system in table [1] 



3.3 Astrometric Calibration 

Astrometric calibration is a multi-stage process and aims 
to provide each image, and any derived catalogues, with a 
World Coordinate System (WCS) to convert between pixel 
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Figure 9. Photometric measurement uncertainties as a function 
of magnitude in 0.5 mag bins for all the objects included in Pho- 
toObjBest. The vertical error bars show the rms dispersion in 
the mean uncertainties in each brightness bin and the horizon- 
tal lines in each panel show the SNR=10 (dotted) and SNR=5 
(dashed) values. 



and celestial coordinates. This happens in the pipeline in 
two generic stages. 

An initial WCS based on knowledge of the instrument, 
e.g. orientation, field-scale, telescope pointing, is embedded 
in the FITS headers, with telescope-specific information in 
the primary header and detector-specific information in the 
secondary headers. This serves to locate each detector image 
to within a few to several arcsec, depending on the pointing 
accuracy of the telescope and model parameters. The essen- 
tial information required is the RA and Dec of the pointing, 
a (stable) reference point on the detector grid for those coor- 
dinates {e.g. the optical axis of the instrument), the central 
pixel scale, the rotation of the camera, the relative orienta- 
tion of each detector and the geometrical distortion of the 
telescope and camera optics, which defines the astrometric 
projection to use. 

Given a rough WCS for the processed frames, a more 
accurate WCS can be defined using astrometric standards. 
We have based our calibration on the 2MASS point source 
catalog ISkrutskie et al.l |2006| ) for several reasons: it is an 
all-sky NIR survey; it is calibrated on the International Ce- 
lestial Reference System (ICRS); it provides at least 100 or 
more suitable standards per pointing; it is a relatively recent 
epoch (mid-1990s) minimising proper motion problems; the 
glob al systematics are bet ter than 100 mas over the entire 
sky (jZacharias et al.ll2003l ): and for 2MASS point sources 
with signahtomoise >10:1 the rms accuracy per source is 
< 100 mas. 

By stacking the astrometric residuals from a series of 
independent pointings and CCD WCS solutions it is possible 
to assess the accuracy of the simple INT distortion model. 
This is illustrated in the figure [11] using the average residuals 
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Figure 10. Star counts along two contrasted lines of sight with 
star densities of '^50,000 deg'^ and different reddening. 



from a stack of a one week run. The end product of the 
full pipeline currently has an astrometric precision better 
than 100 mas over the whole array (ie. across CCDs), as 
determined by analysis of independently calibrated adjacent 
overlapping pointings. 



4 DATA PRODUCTS 
4.1 Images 

There are two types of imaging data products: (1) object 
images and (2) confidence maps. At this point we do not 
provide stacked images from the overlapping frames. We de- 
scribe each of them separately. 

As stated before, each image is a MEF file containing 
the four CCD frames. Each CCD is photometrically and as- 
trometrically calibrated as described in the previous section. 
Photometry can be extracted from the images using the fol- 
lowing expression in the instrumental system 



m = ZP - K (X - 1) - 2.5 logio(//t) + c 



(4) 



where m is the calibrated magnitude, ZP is the zero point 
at unit airmass as extracted from the image headers, k is 
the extinction coefficient, X is the airmass, / is the flux in 
counts and t is the exposure time. The c correction is based 
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Figure 11. Astromctric residuals derived from a series of in- 
dependent WFC pointings. The residuals are shown in physical 
coordinates with respect to the center of the optical axis. 

on the median dark sky recorded in science frames compared 
to the median for all the CCDs and as such is an ancillary 
correction to the gain correction derived from the flatfield 
(usually twilight flats). This correction is in the header key- 
word PERCORR and is typically smaller than 1 per cent and 
also corrects for the small quantum efficiency differences be- 
tween the detectors. 

Astrometric information is written in the FITS headers 
using a ZPN projection which allows for a description of the 
non uniform scale over the field of view without performing 
pixel resampling (necessary e.g. in the case of describing the 
distortions using a more usual tangential projection). 

The aim of conventional flatfielding is to create a flat 
background by normalising out perceived variations from 
uniformly illuminated frames. If the sky area per pixel 
changes then this is reflected in a systematic error in the 
derived photometry. However the change of scale, i.e., the 
change of sky area per pixel, also creates photometric ef- 
fects. The magnitude of the multiplicative correction to be 
applied to the measured flux can be modelled as (Irwin et 
al. 2008, in prep) 

where R is the distance from the optical axis and for the 
WFC the coefficients are Pi = 1 and P3 = 220 (correspond- 
ing to the PV2_1 and Py2_3 WCS keywords as defined in 
ICreisen and Calabrettal (|2002l )'l. Figure [T2l shows the effect 
of this term in the photometry which is at a maximum level 
of only 2 per cent at the outer parts of the frames. The 
value displayed in the figure has to be subtracted from the 
magnitude calculated in equation |4] and is automatically in- 
cluded in the catalogue data products but has to be applied 
by the user when measuring fluxes from the images. Table[3] 
lists the main photometric keywords present in the image 



0.030 




R (arcmin) 



Figure 12. Effect of scale change in photometry (magnitude ver- 
sus distance to field centre). 



Keyword 


Description 


MAGZPT 


[mag] Magnitude zero point for default extinction 




measured at unit airmass 


MAGZRR 


[mag] Error in magnitude zero point 


AIRMASS 


Airmass at start of observation 


EXPTIME 


[sec] Exposure time 


WFFBAND 


Filter bandpass 


SKYLEVEL 


[counts/pix] Median sky brightness 


SKYNOISE 


[counts] Pixel noise at sky level 


SEEING 


[pixels] Mean stellar FWHM 


APCOR 


[mag] Stellar aperture correction 


PERCORR 


[mag] Sky calibration correction 


NUMBRMS 


Number of objects used when computing 




the astrometric fit 


STDCRMS 


[arcsec] Astrometric fit error 



Table 3. Subset of relevant photometric and astrometric key- 
words available from the image headers after pipeline processing. 



headers and needed to convert the fluxes to magnitudes as 
explained in this section. 

J^.l.l Confidence maps 

We deflne a confidence dj map, where j refers to pixel j of 
frame i, that is an inverse variance weight map in units of 
relative variance denoting the "confidence" associated with 
the flux value in each pixel. The Cij are normalised to a 
median level of 100 per cent (i.e. {cij) = 1). This has the 
advantage that the same map can also be used to encode for 
hot, bad or dead pixels, by assigning zero confldence. 

The confidence map for each frame is derived from anal- 
ysis of the fiat fields and is unique for each filter/detector 
combination. It encodes individual pixel sensitivities and al- 
lows for vignetted or bad regions to be masked. These con- 
fidence maps are used in the object detection phase of the 
processing. 

Use of the confidence maps for weighted co-addition of 
frames, or for object detection, simply requires an overall 
estimate of the average noise properties of the frame. This 
can be readily derived from the measured sky noise, in the 
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Poisson noise-limited case, or from a combination of this and 
the known system characteristics {e.g. gain, readout noise). 
All processed frames have an associated derived confidence 
map which is propagated through the processing chain. A 
detailed description of the process is given in Irwin et al. 
2008, in prep. 

4.2 Catalogues 

The object catalogue is made available using a relational 
database system. We have followed the philosophy of reduc- 
ing the number of tables presented to the user as much as 
possible. The database contains four tables which are sum- 
marised below. 

• Fields All (table[4|. Information about the field centres 
and number of objects detected. Each field pair has associ- 
ated a unique field number (f ieldno) and each of those an 
on/off position (onoff where a value of has been assigned 
to the off position and 1 to the on position). These two are 
combined into the f ieldID to provide a unique identifier for 
each pointing. Equatorial and galactic coordinates for each 
pointing as well as number of detections are also provided. 

• Chips All (table [SJ. This table contains parameters 
about the observation (airmass, exposure time. World Co- 
ordinate System) as well as the Data Quality Control at- 
tributes for each chip, i.e, mean seeing and ellipticity, zero 
point, sky level, etc. Also for each pointing the Galactic co- 
ordinates are available as well as t he Galactic extinctio n as 
measured from the Schlegel maps (jSchlegel et al.|[l99^ ) us- 
ing a 4 pixel linear interpolation for each position. 

• PhotoObj (table [6]). Main catalogue containing all de- 
tections with image derived parameters and calibrated mag- 
nitudes. Each row has a set of keys (chip_r_id, chip_i_id, 
chip_ha_id, field_id) which allow the user to enquire 
about the photometric conditions (via the ChipsAll ta- 
ble) under which the measurements of a particular object 
have been done or to which field it belongs. Additional flags 
(e.g seeing_f lag) allow the selection of objects observed in 
particular (seeing) conditions. We have also included the Hi- 
erarchical Triangular Mesh index (htmid) as deflned by the 
Sloan Digital Survey Science Archive 0. The total number 
of detections in this catalogue is 411 million including du- 
plicate observations. Due to the on/off observation strategy 
which leads to every source being observed twice this means 
about 200 million unique objects. 

• PhotoObjBest (table [6]). Subset of PhotoObj with 
additional constraints used to select only those observations 
obtained in good photometric conditions and thus will vary 
little between the IDR and the Final Data Release. The 
data obtained in the Aug 2003 and Dec 2004 observing runs 
have been discarded due to poor photometric conditions. 
This catalogue contains only observations for those nights 
for which the mean magnitude zero point rms in the r' band 
is better than 5 per cent, the mean r' zero point is greater 
than 24.2 and the individual frames have been obtained in 
seeing conditions better than 2 arcsec and ellipticity bet- 
ter than 0.2. Additionally only sources detected in all three 
bands have been included. The total number of detections 

see |http: / / skyserver.org/htm/ 1 





lype 


JJGScriptiori 


ficldID* 


long 


Field ID 


onofT* 


int 


on/ofT observation flag 


ficldno* 


int 


Field number 


ra* 


float 


R.A. of centre of field (deg, J2000) 


dec* 


float 


Dec of centre of field (deg, J2000) 


glon 


float 


Galactic longitude of centre of field (deg) 


glat 


float 


Galactic latitude of centre of field (deg) 


nobj 


int 


Number of objects detected 



Table 4. Column names, types and description of FieldsAll table. 
Columns marked with an asterisk are indexed. 



in this catalogue is 181 million (approximately 90 million 
unique objects). 

• tGetNearbyObj. This is a store procedure to perform 
a cone search without having to use the complete haversine 
formula explicitly to calculate angular distances. Given the 
coordinates (ra, dec) in degrees and a radius (r) in arcmin 
it returns a list of identifiers for the objects in such defined 
cone. The output can then be joined with one of the Pho- 
toObj/Best tables to return the object parameters (see 
example in appendix A). 

Tables |4l [5] and [6] list the column names and their de- 
scription. Columns marked with an asterisk are indexed by 
the database software, i.e., searches in those columns are 
particularly fast. Users should use when possible at least 
one indexed column in order to speed up the queries. 

Each entry in each table is assigned a unique ID number 
used for cross reference. Figure [13] shows an object ID in 
the PhotoObj /Best tables and how it is decoded. Each 
entry in the ChipsAll table is encoded in the same way 
but without the last five figures which represent the object 
number. 

Individual detection star/galaxy classifications are com- 
bined into one single value followin g a similar approach to 
the WFCAM Science Archive (WSA: iHamblv et al.ll2007l l. A 
classification table is defined which assigns probability val- 
ues for each classification code and then each classification is 
combined for a merged source using Bayesian classification 
rules assuming each value is independent: 

where Ck is the classification fiag and i denotes the z"^ 
single detection passband measurement available. Decision 
thresholds for the resulting discrete classification flag are 
90 per cent for definitive and 70 per cent for probable. An 
additional decision rule enforces class_fiag=255 (saturated) 
when any individual classification flag indicates saturation. 



5 CROSS MATCHING WITH UKIDSS GPS 

In iDrew et al.l (|2005l ) we describe the importance of the 
IPHAS data by themselves and the several papers published 
to date using just IPHAS data are clearly a demonstration 
of that. In this section however, we want to focus in the 
multiwavelength aspect which the present IPHAS release 
will facilitate. We provide an example of cross matching 
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Column Name 


Type 


Description 


cnipiij 


long 


cmp lu 


runiio* 


int 


Run number 




float 


r\..-ri-. Oi Llic Cliip CcIiLcl I ClCg, J ZUUU 1 


dec 


float 


Dec of the chip center (deg, J2000) 


glon 


float 


Galactic longitude (deg) 


glat 


noat 


Galactic latitude (deg) 


ebv 


real 


E(^B — V^) from Schlegel dust map 


naxis 1 


iiit 


Chip size along x axis (pixels) 


iia,xis2 


int 


Chip size along y axis (pixels) 


mjd 


noat 


MJD of observation 


majgzptr * 


real 


F*hotometric zero point (Vega) 


magzrr 


real 


Zero point error (Vega) 


extinct 


real 


Atmospheric Extinction 


airmass* 


real 


Airmass 


exptime 


real 


Exposure time (seconds) 


night* 


mt 


iNignt OI ODservation i^Y Y Y i NlNiUU ) 


dateobs* 


int 


jJate or ouservation (^Y Y Y i NiNlUU ) 


band* 


char 


Filter name 


maglim* 


real 


Magnitude limit (5fT, Vega) 


seeing* 


real 


Mean seeing (arcscc) 


elliptic* 


real 


M!ean stellar ellipticity 


CQ1_± 


noat 


\_j 1 J iviaTirix 


cdl 1 


float 


C*T) A/fnfriv 


cdl 1 


float 


V_. 1 J lVi.(HillJt 


cdl 1 


float 






float 


R.A. of reference pixel (deg) 




float 


J.-'Cl. Ul 1 CltJl t;liL,tJ UlJtt;! ILlCtf 


crpixl 


noat 


X coordinate of reference pixel 


crpix2 


float 


y coordinate of reference pixel 


percorr 


real 


Sky correction (Vega) 


apcor 


real 


Aperture correction (Vega) 


skylevel 


real 


Sky level (counts) 


skynoise 


real 


Sky noise (counts) 


nobj 


real 


number of detected objects 


ficldJd* 


long 


link to FieldsAU table 



Table 5. Column names, types and description of data quality 
control tabic ChipsAU. Columns marked with an asterisk are in- 
dexed. 



Field Date Obs. CCD 




I I ' 1 ' 

On/Off Flag Band Object 

Figure 13. The unique object ID encodes the field number, 
on/off flag (l=on, O=off), date of observation (in the format 
YYMMDD), band (l=r'; 2=j'; 3=Ha) and CCD number. 

the sources extracted from the IPHAS catalogue with those 
from the UKDSS GPS. Near infrared data are required to 
probe the many parts of the galaxy which suffer high extinc- 
tion and an ESO-public near infrared survey of the northern 
plane, the UKIDSS Galactic Plane Survey, has recently be- 
gun. 

We have selected a contiguous area of 2 square degrees 
centred at a = 20*" 48™, S = 42°45' {I, b = 83.0, -0.5) in the 
Cygnus-X region and extracted all objects from the Pho- 
toObjBest catalogue. From the lSchlegel et aP ()l998l l maps 
the extinction is E{B — V) = 3. The total number of objects 
in that area is 235,231. In order to assess the photometry 



Column Name Type Description 



objID* 


long 


Object ID 


objname 


char 


Object Name 


ra* 


float 


R.A. in degrees (deg, J2000) 


dec* 


float 


Dec in degrees (deg, J2000) 


glon 


float 


Galactic longitude (deg) 


glat 


float 


Galactic latitude (deg) 


coremag_r 


float 


r' aperture corrected magnitude 


coremagj 


float 


i' aperture corrected magnitude 


coremag_ha 


float 


Ha aperture corrected magnitude 


coremagerr_r 


float 


r' magnitude error 


coremagerrj 


float 


i' magnitude error 


coremagerr_ha 


float 


Ha magnitude error 


rmi* 


float 


r'- i' colour 


rmha* 


float 


r'- Ho colour 


rmierr 


float 


r'- i' colour error 


rmhacrr 


float 


r'- Ha colour error 


class_r 


int 


r' star/galaxy clasification index 


class j 


int 


i' star/galaxy clasification index 


classjia 


int 


Ha star/galaxy clasification index 


cllipt_r 


real 


ellipticity as measured in r' 


elliptj 


real 


ellipticity as measured in i' 


ellipt_ha 


real 


ellipticity as measured in Ho 


posang_r 


real 


position angle as measured in r' 


posangj 


real 


position angle as measured in i' 


posang_ha 


real 


position angle as measured in Ha 


seeing_flag* 


int 


flag sources with seeing <1.7 


photom_flagt 


int 


zero if object detected in all bands 


class_flag* 


int 


combined probabily of stellar object 


chip_r_id* 


long 


link to r' chip table 


chipjjd* 


long 


link to i' chip table 


chip_ha_id* 


long 


link to Ha chip table 


cxt 


float 


Spherical coordinate x=cos(o)sm((5) 


cyt 


float 


Spherical coordinate y=sin[a)sin{S) 


czt 


float 


Spherical coordinate z=cos(5) 


htmid* 


long 


20-doep HTM index of this object 


fieldJd* 


long 


link to field table 



Table 6. Column names, types and description of main photo- 
metric tables PhotoObj and PhotoObjBest. Columns marked 
with an asterisk are indexed. Columns marked with f are not 
included in the PhotoObjBest table. 



Index Meaning Probability (%) 

Star Extended Noise Saturated 



-9 


Saturated 








5 


95 


-3 


Probable extended 


25 


70 


5 





-2 


Probable star 


70 


25 


5 





-1 


Star 


90 


5 


5 








Noise 


5 


5 


90 





1 


Extended 


5 


90 


5 






Table 7. Meaning of classification index and probability assigned 
to each of them. 



we have grouped all the objects b y field and used the field 
overlap algorithm as described in iGlazebrook et al.l (|l994h 
in order to offset the photometry to a common reference 
system. 

For this exercise we have just set one frame as the ref- 
erence calibrated frame and allowed the others to vary and 
solved the matrix equation for each band. The resultant 
magnitude offsets between the different fields is smaller than 
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Figure 14. Positional offsets in R.A. (left) and Dec (right) be- 
tween GPS and IPHAS positions (arcsec) in one of the fields 
included in the area considered in the text. 



~0.03 in the three bands confirming the good photometric 
calibration. 

For the same area we extract also the near-IR mag- 
nitudes from the UKIDSS GPS survey. The GPS is carry- 
ing out a complementary survey of the galactic plane in J, 
H and K. Using AstroGrid access to the WFCAM Science 
Archive (WSA) we download the near-IR catalogue of ob- 
jects detected in the area covered by the IPHAS observa- 
tions. We then crossmatch the GPS and IPHAS catalogues 
selecting the nearest neighbour in a radius of 1.2 arcsec us- 
ing STILT^. Figure [TJ] shows the positional offsets between 
the IPHAS and GPS catalogues for one field. The offset is 
effectively zero and the rms is 0.13" and 0.12" in Ao and AS 
respectively. This good agreement is not coincidence since 
both surveys use 2MASS as the reference astrometric sys- 
tem and the WFCAM processing pipeline utilises the same 
algorithms developed at CASU for the optical processing. 
It is however worth noting the success of the astrometric 
pipeline in such different data products. 

The importance of t he (r' — Ha, r' — i' ) plane has been 
extensively discussed in iDrew et al.l (|2005l ) . The mean se- 
quence is clearly defined and sources above it are emission 
line objects (see figure [TS}. We explore here the (r' — Ha, 
r' — K) plane provided by the combination of IPHAS and 
GPS. 

Using the SED model library from IPicklesI (|l998l ) 
and the WFCAM filter profiles we have simulated the 
co lours of normal sta rs. The detailed process is described 
in iDrew et al] (|2005l ) and Sale et al. 2008 (in prep) and 
we list in table |H] the synthetic colours for dwarfs and gi- 
ants only. The tracks are plotted in figure [15] for extinc- 
tion between zero and four. Figure [161 shows the (r' — Ha, 
r' — K) and (r' — Ha, r' — i') colour-colour diagrams with 
E{B - 1/) = and E{B - V) ^ 2 reddening tracks for 
dwarfs and giants overplotted. We have included in these 
plots only sources classified as point-like in the optical and 
near-IR data (class_r=-l , -2, pStar>0 . 7) with magnitude 
errors smaller than 0.04 in r' and K. We have also excluded 



objects brighter than r' =13 and Ti" = 12 to avoid satura- 
tion and finally we have removed all sources within 10 arcsec 
of any WFCAM chip edge (although problematic sources in 
the edges are anyway flagged by the pipeline classifier). 

The main sequence of objects at zero reddening is well 
defined in the {r' — Ha, r' — K) plane with a large population 
of objects up to the M stars. These data are for fields in the 
Cygnus-X part of the plane where there is a local essentially 
unreddened stellar population to a few hundred parsecs, and 
then a decline in stellar density and sharp rise in molecular 
gas and extinction to a distance of a kpc or so. Beyond 
this, the stellar density recovers as distant associations are 
traversed but of course it presents as a much more heavily 
reddened population. Furthermore stars later than spectral 
type mid-K are intrinsically faint and are only detectable 
at E{B — V) < 0.8 in the figure [16] where our limit is 
at r' — K < 7. This explains the lack of K and M dwarfs 
reddened objects in these diagrams. 

Note that no artificial offsets have been introduced to 
the synthetic tracks to make them fit with the observed 
colours. This is further confirmation of the good photometric 
calibration of both the IPHAS and GPS catalogues. 



6 USER ACCESS 

User access to the catalogues and images has been provided 
by the AstroGrid project. There are two separate user in- 
terfaces to the IPHAS IDR. Users who only want to extract 
fluxes or images around a small position of sky (there is a 
limit on the radius of 1 degree) will find the web interface 
usefu{f|. Given a position or an object name the user will ob- 
tain a cone search of the PhotoObjBest catalogue in a va- 
riety of output formats (VOTable, Binary VOTable, Comma 
Separated Value or HTML). Input lists of objects are also 
supported. Together with the IPHAS catalogue we provide 
access from the same interface to other useful VO com- 
plian t archives like SDSS DR5 (jAdelman-M cCart hv et al 
20071 ). 2MASS Point Source Catalogue (jSkrutskie et al 



20061 ) ■ NED and the Car lsberg Meridian Astrometric Cata- 
logue ([Evans et al.ll200^ ). Postage stamps generation for a 
particular input position is also available using the finding 
chart utility (see example output in figure [T7[) . 

Users who want more sophisticated queries on flags and 
algebraic combinations of parameters (e.g. colours) will find 
the AstroGrid Query Builder more suited to their needs. 
The Query Builder is an application inside the AstroGrid 
VO Desktop which allows to submit an arbitrary query to 
the database system. The Query Builder facilitates the task 
of building complex queries written in Astronomical Dataset 
Query Language (ADQllfJ; an international agreed standard 
based on the Structured Query Language - SQL), save the 
queries and submit them asynchronously to the server (this 
facilitates sending long running queries and retrieve the re- 
sults later). 



STILTS is a set of command line tools to per- 
form operations with tables like cross matching * Available from'http://www. iphas.org/idr 
jhttp : / / www. star .bris.ac.uk/ -^-^mbt /stilts/^ ^ http://www.ivoa.net/Documents/latest/ADQL.html 
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r3577B3.flt[4] 



r367782.fit[4] 

r 



r367781,fit[4] 
Habha 




Figure 17. Example postage stamps obtained using the finding chart utility. Each stamp is labeled with the run number, CCD number 
and filter. The object corresponds to the Principe de Asturias quadrupolar nebula presented in Mampaso et al. (2006). 
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Spectral Reddening 
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Table 8. Synthetic tracks in the (r' — J, J — K) plane, for main sequence dwarfs and giants, calculated for a range of reddening. A 
similar table with optical colours only can be found in table 2 of Drew et al. 2005. 
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6.1 Image Access 

The Simple Image Access Protocol (SIAP) defines a pro- 
totype standard for retrieving image data from a variety 
of astronomical image repositories through a uniform in- 
terfac^EI. The IPHAS images are available through a SIAP 
(ivo://uk.ac.cam.ast/IPHAS/images/SIAP) interface. The 
result of a query is, given a box centred at coordinates RA, 
Dec and a box size in degrees, a table of image CCDs which 
overlap the defined box. The table contains links to the pro- 
cessed image file itself. Alternatively the same list of images 
can be obtained with an appropriate SQL query (see be- 
low). At the moment only full image retrieval is supported. 
Although the graphical interface does not allow for a list of 
positions this is possible to accomplish from the command 
line interface. 

6.2 Full Catalogue Access 

The ingestion of all the catalogue data into a database pro- 
vides the capability of using SQL in order to query the cata- 
logue with any user defined constraint. The AstroGrid Data 
Set Access (DSA) provides a layer on top of any database 
system which makes it readily VO compliant and accessi- 
ble from any VO compliant application. The examples in 
appendix A provide ADQL queries that can be used in the 
Query Builder to retrieve a variety of data. 

6.3 Command line access 

The AstroGrid Run Time (AR) provides a bridge between 
the user command line tools or programs and the VO. Cata- 
logues can be accessed using this bridge from programming 
languages like Python, Perl, C or Java. There are several 
reasons to provide a command line interface to the VO. The 
more important one is the ability to perform batch queries 
for a list of objects and also to implement a call to a VO ser- 
vice inside a user written script that performs some analysis 
afterwards. 



7 SUMMARY 

We have presented the Initial Data Release of the IPHAS 
project. The total release contains photometry in r', i' , and 
Hq for about 200 million objects in the northern galactic 
plane comprising the largest photometric sample available 
imaged in narrowband Ha. We have also made available a 
subset of this catalogue, PhotoObjBest, which contains 
the best observations in terms of final survey quality as de- 
fined in the text and which comprises about 60 per cent of 
the final release. The legacy of the survey is increased by 
the extensive coverage appearing from other projects span- 
ning the wavelength range range from X-rays to near-IR and 
sub- millimeter. 

The data have been made available through a VO com- 
phant interface provided by AstroGrid and accessible to all 
VO projects. 

This initial release will be supplemented with data ob- 
tained from January 2006 onwards at a later date to be 

^ See |http: / / www.ivoa.net /Documents /latest/SIA.html | 
© 2008 RAS, MNRAS OOO.mnTl 



announced in the project web pages. At the time of writing, 
data-taking continues. There will also be a separate Final 
Release once the survey is entirely complete and all finally- 
accepted data have been corrected onto a fully-interlaced 
uniform photometric scale. 

More information and examples are available 
from the IPHAS IDR and AstroGrid websites at 
^http: / / www. iphas.org/idr I and ,http: / /www. astrogrid.org, 
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APPENDIX A: EXAMPLE USAGE ACCESS 

Example 1: Return coordinates, magnitudes and magnitude 
errors of objects in a rectangular area. 

This is the most basic query returning all objects in a 



specific box with R.A. between 300 and 301 degrees and Dec 
between 30 and 31 degrees. 

SELECT P.ra, P. dec, 

P . coremag_r , P . coremag_i , P . coremag_ha, 

P . coremagerr_r , P . coremagerr_i , P . coremagerr_ha 

FROM PhotoObjBest as P 

WHERE P.ra BETWEEN 300.0 AND 301.0 
AND P. dec BETWEEN 30.0 and 31.0 

Example 2: Return coordinates and magnitudes of objects in 
a cone defined by its center at RA=300.0 and Dec=30.0 and 
radius r—10 arcmin. 

In this case instead of coding the whole trigonometry 
in the query we use the auxiliary table tGetNearbyObj 
which returns the object IDs which satisfy a cone search 
specified by the center coordinates (ra, dec) and radius (r) 
in minutes of arc: 

SELECT P.ra, P. dec, 

P . coremag_r , P . coremag_i , P . coremag_ha, 
P . coremagerr_r , P . coremagerr_i , P . coremagerr_ha 
FROM PhotoObjBest as P, tGetNearbyObj as G 
WHERE G.ra=300.0 AND G.dec=30.0 AND G.r=10.0 
AND P.objID = G.objID 

Example 3: As previous example but only return objects clas- 
sified as point-like in r and observed in conditions of seeing 
best than 1.5 arcsec as measured in the r band. 

In this query we are asking for information about the 
quality of observations and this is stored in the ChipsAU 
table. We need therefore to join the results with this table in 
order to select only observation with good seeing conditions. 

SELECT P.ra, P. dec, 

P . coremag_r , P . coremag_i , P . coremag_ha, 

P . coremagerr_r , P . coremagerr_i , P . coremagerr_ha 

FROM PhotoObjBest as P, tGetNearbyObj as G, 
ChipsAll as CR 

WHERE G.ra=300.0 AND G.dec=30.0 AND G.r=10.0 

AND CR.seeing<1.5 AND (P.class_r=-1 OR P.class_r=- 
AND P.objID = G.objID AND P . chips_r_id=CR. chipID 

Adding additional constraints on e.g. object colours, 
magnitude limits, observation dates are trivially imple- 
mented in the same way, e.g.: 

SELECT P.ra, P. dec, 

P . coremag_r , P . coremag_i , P . coremag_ha, 

P . coremagerr_r , P . coremagerr_i , P . coremagerr_ha 
FROM PhotoObjBest as P, tGetNearbyObj as G, 

ChipsAll as CR 
WHERE G._ra=300.0 AND G._dec=30.0 AND G._r=10.0 

AND P . coremag_r-P . coremag_ha>0 . 8 

AND P . coremag_r-P . coremag_i>l . 5 

AND P . coremagerr_r<0 . 02 

Example 4- Select all objects observed in a particular offset 
field. 

The following example returns all objects detected in 
a particular observation from the (not cleaned) PhotoObj 
catalogue. 

SELECT * FROM PhotoObj as P 

JOIN FieldsAll as F ON (F.fieldID = P.field_id) 
WHERE F.fieldiio=7012 AND F.onoff=0 
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Example 5: Return the number and position of fields observed 
in the best photometric conditions. 

Section 14.21 explains the constrains used to define the 
catalogue of best observations. The following query returns 
the number, position and photometric characteristics of the 
fields which satisfy those conditions. 

SELECT 

CC. night, F.fieldID, F.fieldno, F.onoff, 
F.ra, F.dec, F.glon, F.glat, 
T.magzpt_mean, T.magzrr_mean, 
CC. seeing, CC. elliptic 

FROM 

( SELECT 

C . night , 

sum (C .magzpt) /count (C .magzpt) as magzpt_mean, 
sum (C .magzrr) /count (C .magzrr) as magzrr_mean 

FROM FieldsAll F, ChipsAll C 

WHERE 

C.field_id = F.fieldID AND C.band='r' 

AND C.chipno=4 
GROUP BY night) AS T, 
ChipsAll CC, FieldsAll F 
WHERE 

T. night = CC. night AND CC.field_id = F.fieldID 
AND CC.chipno=4 AND CC.band='r' AND CC.seeing<2 
AND CC.elliptic<0.2 AND T.magzpt_mean>24.2 
AND T.magzrr_mean<0.05 
ORDER BY night 



The IDR web site (http://idr.iphas.org I provides fur 



ther information as well as examples on accessing the cata- 
logue from the command line. 



This paper has been typeset from a Tp^X/ M^rjK file prepared 
by the author. 
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